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PRODUCTION OF MULTIMERIC FUSION PROTEINS USING A C4BP SCAFFOLD 



Introduction. 

This invention relates to methods for producing high yields of 
5 fusion proteins and polypeptides comprising a C4bp domain in 
prokaryotic cells. 

Background of the Invention, 

The advent of recombinant DNA technology has provided the 
10 possibility of large scale production of biologically active 
proteins for therapeutic use. There are now many recombinant 
DNA produced products in the clinic or under development, 
including large proteins such as erythropoietin, small 
peptides, and antibody fragments. 

15 

It is known in the art that a difficulty with proteins is one 
of half life. Many proteins and peptides have a short half- 
life in vivo, reducing their usefulness. It has been found 
that multimerisation of protein and peptide molecules is a way 

20 of increasing the half-life of these molecules thus allowing 
them to exert their activity over a longer time scale. Many 
functional biological molecules have been found to be more 
potent in vivo when in the form of an oligomeric structure. 
This is due to factors such as binding with avidity rather 

25 than affinity, and/or the ability to cross-link molecules 

(e.g. identical receptor subunits as in the insulin receptor 
that are activated through dimerisation, or non-identical 
molecules to form signalling complexes on the cell surface, 
such as in lymphocytes) . These properties of increased half- 

30 life and avidity enable lower doses of the protein and peptide 
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molecules to be used, thereby reducing costs and dose- 
dependent side-effects. 

Different approaches have been proposed for making multimers 
5 of recombinant proteins. For example, chemical linkage of 
proteins to polymers such as polyethylene glycol has been 
attempted (Katre et al., (1987) Proc. Natl. Acad. Sci. USA 84, 
1487) . This technique, however, is cumbersome and requires 
large amounts of purified material. In antibody molecules, 
10 modifications of the disulphide-forming possibilities in the 
hinge, and other regions of the molecules have been attempted 
in order to modulate the extent to which antibodies will 
associate with each other. Results however have been 
inconsistent and unpredictable. Similarly, use of protein A 
15 fusions to generate multimeric antibodies may successfully 
link antibody fragments, but is of limited application in 
other fields. 

A new multimerisation system using the complement 4 binding 
10 protein (C4bp) is described in WO 91/11461. Human C4-binding 
protein (C4bp) is a plasma glycoprotein of high molecular mass 
(570 kDa) which has a spider like structure made of seven 
identical alpha-chains and a single beta-chain. The C4bp alpha 
chain has a C-terminal core region responsible for assembly of 
15 the molecule into a multimer. According to the standard 

model, the cysteine at position +498 of one C4bp monomer forms 
a disulphide bond with the cysteine at position +510 of 
another monomer. A minor form comprising only seven alpha- 
chains has also been found in human plasma. The natural 
10 function of this plasma glycoprotein is to inhibit the 
classical pathway of complement activation. 
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WO 91/11461 proposes that the ability of the C4bp protein to 
multimerise can be used to make fusion proteins comprising all 
or part of C4bp and a biological protein of interest. The 
fusion protein will form multimers which provides a platform 
5 for the protein of interest, in which said protein has an 

enhanced serum half-life and increased affinity or avidity for 
its targets. Fusion proteins of C4bp were targeted as the 
focus of novel delivery and carrier systems for therapeutic 
products in WO 91/11461. 

iO 

Most of the alpha-chain of C4bp is composed of eight tandemly 
arranged domains of approximately 60 amino acids in length 
known as complement control protein (CCP) repeats. Inclusion 
of one or more of these domains was preferred in the fusion 
15 proteins described in WO 91/11461, but it has since been 

demonstrated that all CCPs can be deleted (leaving only the C- 
terminal 57 amino acids) without preventing multimerisation 
(Libyh M. T. et aJ., (1997) Blood 90, 3978). This C-terminal 
region of C4bp is referred to as the C4bp core. 

!0 

Libyh et al, (1997), describe a protein multimerisation system 
which is based on the C-terminal part of the alpha chain of 
C4bp. The C-terminal part of the C4bp lacks biological 
function, but is responsible for polymerisation of C4bp in the 

>5 cytoplasm of CHO cells producing C4bp. Libyh et al. were able 
to induce spontaneous multimerisation of associated antibody 
fragments to create homomultimers of scFv fragments using the 
C4bp fragment- The C-terminal portion of C4bp used was placed 
C-terminal to the scFv sequence, optionally spaced by a MYC 

)0 tag. 

Oudin et ai. (2000, Journal of Immunology, 164, 1505) further 
use the C4bp core multimerising system for forming hetero- 
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multimeric multi CRl/scFv anti-Rh(D) molecules. The chimera 
proteins were expressed in a CHO cell line by co-transf ection 
of these cells and by two different vectors (one encoding CRl 
and the other encoding ScFv anti Rh-D) and were found to 
5 spontaneously multimerise in the cytoplasm of the transfected 
cells from which they were secreted. 

Christiansen et al. (2000, Journal of Virology, 74, 4672) 
further demonstrate the production of homo-multimeric fusion 
0 proteins encompassing the CD46 ectodomain linked to the C4bp 
core in 293 EBNA cells. 

Self-assembling multimeric soluble CD4-C4bp fusion protein 
have also been demonstrated in Shinya et al. (1999, Biomed & 
15 Pharmacother, 53, 471) where the fusion proteins were 
expressed in 293 cells. 

Shinya et ai. further suggest that the pharmacokinetic 
properties of fusion proteins containing the C4bp core domain 
10 are modified due to the increase in the in vivo plasma half- 
life of these recombinant fusion proteins in mice. As the core 
domain used is of human origin, adverse immunological 
consequences from its administration to humans would be 
minimised. 

>5 

To date, fusion proteins based on C4bp core protein have been 
expressed in eukaryotic cells. The yields of fusion protein 
from eukaryotic cells has rarely reached 2 micrograms per 
millilitre of culture supernatant (Oudin et ai. ihid) and this 
)0 could be achieved only after rounds of gene amplification. 
This level is too low for the economic production of large 
quantities of many fusion protein for therapeutic use. 
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One possible way of achieving higher yields would be to use a 
prokaryotic expression system. WO91/00567 suggests that 
prokaryotic host cells may be used in the production of C4bp- 
based proteins, though there is no experimental demonstration 

5 of any such production. A number of considerations however, 
would suggest that the use of prokaryotic systems would be 
disadvantageous. In particular, many eukaryotic proteins 
lose some or all of their active folded structure when 
expressed in cells such as Escherichia coli. Other eukaryotic 

10 proteins denature or are completely inactive when expressed in 
prokaryotic cells. 

C4bp is a secreted protein in mammals, and these are known in 
the art to be particularly difficult to produce in a correctly 

15 folded form in prokaryotes. Proteins with disulphide bridges 
are particularly problematic, as are those that require 
oligomerisation, Disulphide bonds are not normally produced in 
the reducing environment of the bacterial cytoplasm, and when 
they can form, they can stabilise misfolded or aggregated 

*0 forms of the protein. 

Usually, recombinant proteins expressed in prokaryotes are 
aggregated inside inclusion bodies within the host prokaryotic 
cell. These are discrete particles or globules separate from 

55 the rest of the cell which contain the expressed proteins 

usually in an agglomerated or inactive form. The presence of 
the expressed protein in the inclusion bodies makes it 
difficult to recover the protein in active soluble form as the 
necessary refolding techniques are techniques are inefficient 

50 and costly. Proteins purified from inclusion bodies have to be 
laboriously manipulated, denatured and refolded to obtain 
active functional proteins at relatively poor yields. 
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With regard to expressing C4bp core fusion proteins in 
prokaryotic cells, other considerations have also to be taken 
into account. Firstly, each core monomer retains two cysteine 
residues, and according to the model of C4bp multimers 

5 accepted in the art, these cysteines are required to form 
inter-molecular disulphide bonds during the assembly of 
multimers. The reducing environment of the prokaryotic 
cytosol such as the bacterial cytosol would be expected to 
prevent the formation of C4bp core multimers by reducing these 

10 disulphide bonds. 

Secondly, multimers are assembled during passage through the 
eukaryotic secretion apparatus, which is known to assist 
protein folding in ways not available in prokaryotes (e.. g. the 
15 presence of protein disulphide isomerase and unique 

chaperones) . Thirdly, even under conditions where relatively 
small yields were obtained in eukaryotic cells (micrograms per 
millilitre) , this secretory pathway is unable to produce 
homogenous protein. 

>0 

Summary of the Invention, 

The inventors have surprisingly found that fusion proteins of 
C4bp core are not only efficiently synthesized in prokaryotic 
cells but that the C4bp core itself is capable of folding 
25 correctly, and assembling into homogeneous multimers in the 
reducing environment of the prokaryotic cytosol. The 
multimers of C4bp core which are produced in prokaryotic cells 
surprisingly have been found to contain disulphide bonds • 

30 Further, the inventors have also found that proteins fused to 
the C4bp core produced in the prokaryotic expression systems 
retain their functional activity. The present invention 
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therefore provides a method for obtaining a recombinant fusion 
protein comprising a scaffold of a C-terminal core protein of 
C4bp alpha chain, said recombinant fusion protein being 
capable of forming multimers in soluble form in the cytosol of 
5 a prokaryotic host cell, the method including the steps of 

(i) providing a prokaryotic host cell carrying a nucleic 
acid encoding said recombinant protein operably linked to 
a promoter functional in said prokaryotic cell; 

(ii) culturing the host cell under conditions wherein 
10 said recombinant protein is expressed; and 

(iii) recovering the recombinant protein wherein said 
protein is recovered in multimeric form without 
performing a scaffold refolding step. 

15 We have found that the yield of protein in cell cultures of 

the invention can be relatively high, for example greater than 
2 mg/1 of culture, such as greater than 5 mg/1 of culture, 
preferably greater than 10 mg/1 of culture, such as greater 
than 20 mg/1 culture, and even more preferably greater than 

10 100 mg/1 culture, 

C4bp core fusion proteins of the invention comprise a C4bp 
core protein sequence fused, at the N- or C-terminus, to a 
biologically active sequence of interest. 

15 

Description of the Drawings. 

Figure 1 shows an alignment of C4bp sequences from different 
species. 
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Figure 2 shows purification of the fusion protein db-C4bp 
(where db is a peptide described in Example 1) from an ion- 
exchange column. 
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Figure 3 shows further purification of db-C4bp on a second 
ion-exchange column, 

5 Figure 4 shows purification of db-C4bp on a gel chromatography 
column. 

Figure 5 shows purification of db-C4bp on an ion-exchange 
column following a heating step. 

10 

Figure 6 shows further purification of db-C4bp on a gel 
chromatography column. 

Figure 7 shows the activity of DsbA-C4bp in an insulin assay. 

15 

Figure 8 shows the sequence of the promoter and C4bp coding 
' region in pAVD77, 

Figure 9 shows analysis of C4bp fusion proteins under non- 
>0 reducing conditions. 

Detailed Description of the Invention. 
Core protein of C4bp alpha chain. 

This is referred to herein as the ''C4bp core protein" or ^'core 
>5 protein", or ''C4bp scaffold". The terms are used 

interchangeably. This protein may be a mammalian C4bp core 
protein or a fragment thereof capable of forming multimers, or 
a synthetic variant thereof capable of forming multimers. 
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The sequences of a number of mammalian C4bp proteins are 
available in the art. These include human C4bp core protein 
(SEQ ID N0:1). There are a number of homologues of human C4bp 
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core protein available in the art. There are two types of 
homologue: orthologues and paralogues. Orthologues are defined 
as homologous genes in different organisms, i.e. the genes 
share a common ancestor coincident with the speciation event 
5 that generated them. Paralogues are defined as homologous 

genes in the same organism derived from a gene, chromosome or 
genome duplication, i.e. the common ancestor of the genes 
occurred since the last speciation event. 

10 For example, a search of GenBank indicates mammalian C4bp core 
homologue proteins in species including rabbit, rat, mouse and 
bovine origin (SEQ ID NO: 2-5 respectively). Paralogues have 
been identified in pig (ApoR) , guinea pig (AM67) and mouse 
(ZP3); shown as SEQ ID NO: 6-8 respectively. 

15 

An alignment of SEQ ID N0s:l-8 is shown as Figure 1. It can 
be seen that all eight sequences have a high degree of 
similarity, though with a greater degree of variation at the 
C-terminal end. Further C4bp core proteins may be identified 
>0 by searching databases of DNA or protein sequences, using 
commonly available search programs such as BLAST. 

Where a C4bp protein from a desired mammalian source is not 
available in a database, it may be obtained using routine 

15 cloning methodology well established in the art. In essence, 
such techniques comprise using nucleic acid encoding one of 
the available C4bp core proteins as a probe to recover and to 
determine the sequence of the C4bp core proteins from other 
species of interest. A wide variety of techniques are 

JO available for this, for example PGR amplification and cloning 
of the gene using a suitable source of mRNA (e.g. from an 
embryo or an actively dividing differentiated or tumour cell), 
or by methods comprising obtaining a cDNA library from the 
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mammal, e.g. a cDNA library from one of the above-mentioned 
sources, probing said library with a known C4bp nucleic acid 
under conditions of medium to high stringency (for example 
0.03M sodium chloride and 0.03M sodium citrate at from about 
5 50°C to about 50°C), and recovering a cDNA encoding all or 

part of the C4bp protein of that mammal. Where a partial cDNA 
is obtained, the full length coding sequence may be determined 
by primer extension techniques. 

10 A fragment of a C4bp core protein capable of forming multimers 
may comprise at least 47 amino acids, preferably at least 50 
amino acids. The ability of the fragment to form multimers 
may be tested by expressing the fragment in a prokaryotic host 
cell according to the invention, and recovering the C4bp 

15 fragment under conditions which result in multimerisation of 
the full 57 amino acid C4bp core, and determining whether the 
fragment also forms multimers. Desirably a fragment of C4bp 
core comprises at least residues 6-52 of SEQ ID N0:1 or the 
corresponding residues of its homologues. 

10 

The human C4bp core protein of SEQ ID N0:1 corresponds to 
amino acids +493 to +549 of full length C4bp protein sequence. 
A fragment of this known in the art to form multimers 
corresponds to amino acids +4 98 to +54 9 of C4bp core protein. 

15 

Variants of C4bp core and fragments capable of forming 
multimers, which variants likewise retain the ability to form 
multimers (which may be determined as described above for 
fragments) may also be used. The variant will preferably have 
30 at least 70%, more preferably at least 80%, even more 

preferably at least 90%, for example at least 95% or most 
preferably at least 98% sequence identity to a wild type 
mammalian C4bp core or a multimer-f orming fragment thereof. 
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In one aspect, the C4bp core will be a core which includes the 
two cysteine residues which appear at positions 6 and 18 of 
SEQ ID Nos:l-3 and 5-8. Desirably, the variant will retain 
the relative spacing between these two residues. 

5 

The above-specified degree of identity will be to any one of 
SEQ ID N0s:l-8 or a multimer-f orming fragment thereof. 

Most preferably the specified degree of identity will be to 
10 SEQ ID N0:1 or a multimer-f orming fragment thereof. 

The degree of sequence identity may be determined by the 
algorithm GAP, part of the "Wisconsin package" of algorithms 
widely used in the art and available from Accelrys (formerly 

15 Genetics Computer Group, Madison, WI) . GAP uses the Needleman 
and Wunsch algorithm to align two complete sequences in a way 
that maximises the number of matches and minimises the number 
of gaps. GAP is useful for alignment of short closely related 
sequences of similar length, and thus is suitable for 

20 determining if a sequence meets the identity levels mentioned 
above. GAP may be used with default parameters. 

Synthetic variants of a mammalian C4bp core protein include 
those with one or more amino acid substitutions, deletions or 

25 insertions or additions to the C- or N-termini. Substitutions 
are particularly envisaged. Substitutions include 
conservative substitutions. Examples of conservative 
substitutions include those set out in the following table, 
where amino acids on the same block in the second column and 

30 preferably in the same line in the third column may be 
substituted for each other: 
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Non-polar 
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Polar - uncharged 


C S T M 


N Q 


Polar - charged 


D E 


K R 


AROMATIC 




H F W Y 


OTHER 
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Examples of fragments and variants of the C4bp core protein 
which may be made and tested for their ability to form 
multimers thus include SEQ ID NOs: 9 to 16, shown in Table 1 
5 below: 



A 


B 


C 


9 


CEQVLTGKRLMQCLPNPEDVKMALEVYKLSLEIEQLELQRDSARQSTLDKEL 


100 


10 


ETPEGCEQVLTGKRLMQCLPNPEDVKMALEVYKLSLEIKQLELQRDSARQSTLDKEL 


98 


11 


CEQVLTGKRLMQCLPNPEDVKMALEVYKLSLEIKQLELQRDSARQSTLDKEL 


98 


12 


ETPEGCEQVLTGKRLMQCLPNPEDVKMALEIYKLSLEIEQLELQRDSARQSTLDKEL 


98 


13 


ETPEGCEQVLTGKRLMQCLPNPEDVKMALEIYKLSLEIKQLELQRDSARQSTLDKEL 


96.5 


14 


EGCEQALTGKRLMQCLPNPEDVKMALEIYKLSLEIKQLELQRDSARQSTL 


94 


15 


ETPEGSEQVLTGKRLMQSLPNPEDVKMALEVYKLSLEIKQLELQRDSARQSTLDKEL 


94 


16 


EGSEQALTGKRLMQSLPNPEDVKMALEIYKLSLEIEQLELQRDSARQSTLDK — 


92.3 



A=SEQ ID NO:; B= sequence, C= % identity, calculated by 
reference to a fragment of SEQ ID N0:1 of the same length. 

0 

Where deletions of the sequence are made, apart from N- or C- 
terminal truncations, these will preferably be limited to no 
more than one, two or three deletions which may be contiguous 
or non-contiguous. 

15 

Where insertions are made, or N- or C-terminal extensions to 
the core protein sequence, these will also be desirably 
limited in number so that the size of the core protein does 
not exceed the length of the wild type sequence by more than 
!0 20, preferably by more than 15, more preferably no more than 
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10, amino acids. Thus in the case of SEQ ID N0:1, the core 
protein, when modified by insertion or elongation, will 
desirably be no more than 77 amino acids in length. 

5 N- or C-terminal extensions may include flexible linkers such 
as (Gly-Gly-Gly-Gly-Ser) n (where n is from 1 to 4) used in the 
art to attach protein domains (particularly antibody V 
domains) to each other. 

10 When the fusion proteins of the invention are made by chemical 
synthesis, N- or C-terminal extensions may include analogues 
of amino acids not naturally present in proteins which can be 
used in the art of peptide and polypeptide synthesis. 

15 Recowbxnant protein. 

The recombinant protein of the invention will comprise a C4bp 
core (or ^^scaffold") as described above either alone or linked 
in-frame to at least one sequence of biological interest. 
Such a sequence may comprise a tag useful for identification 
10 or purification of the protein, and/or a protein useful in 
therapy, particularly human therapy. 

The recombinant protein can be described as having a general 
structure of the formula: BiN-Co-Bic in which Co is the core 
25 protein as described above, and Bin is either the amino 

terminus of the core protein or at least one sequence (for 
example one or two) of biological interest, and Bic is either 
the C-terminus of the core protein or at least one sequence 
(for example one or two) of biological interest. 

30 

Preferably, one of BIn and Bic is not a sequence of biological 
interest (i.e. one or other is a terminal of the fusion or 
optionally a tag, such as a polyhistidine tag, to aid recovery 
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of the protein) . More preferably, the biological sequence of 
interest is represented by BIn. 

Alternatively, a protein or non-protein product of interest 
5 may be coupled by synthetic means to a side-chain of the core, 
e.g. through the amino group of the side-chain of a lysine 
residue or through cysteine residues added within, or at the 
end of, the core sequence; or to the existing cysteine 
residues. 

10 

It is preferred that the biological sequence of interest is 
not all or part of a C4 binding protein normally linked to the 
C4bp core protein, i.e., the biological sequence of interest 
is a heterologous sequence. 

15 

We have found that proteins falling within the above 
definition can be expressed in and recovered from bacterial 
expression systems in multimeric form without the need for 
scaffold refolding. We have expressed proteins which have a 
10 monomer weight up to about 30 kDa. The invention may thus be 
used to express proteins in this size range, and more 
generally for proteins up to about 100 kDa, more preferably 
about 50 kDa. 

15 A particular class of fusion proteins will be those in which 

the C4bp core is fused to a peptide of from 2 to 25 amino acid 
residues. Many biologically active peptides are known or can 
be selected through phage display. However, they are often 
unstable in vivo, not least because they can be filtered 

10 through the renal glomerulus. Fusing them to the core scaffold 
makes filtration impossible. In addition, it confers avidity 
on the oligomerised peptides (such that they bind their 
targets more tightly and are effective at lower doses, and can 
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cross-link receptors). Particular biologically active 
peptides of interest include naturally occurring peptide or 
polypeptide hormones, such as somatostatin, calcitonin and 
alpha-MSH (melanocyte stimulating hormone) and variants 
5 thereof as well as other mentioned elsewhere herein. 

Thus a range of fusion proteins of C4bp core protein may 
synthesized using the method of the present invention. The 
multimeric fusion proteins produced will be expected to 
10 exhibit increased bioactivity because multimers will have a 

higher density of the moiety attached to the C4bp core protein 
and would thus be expected to have a longer half life and an 
decreased turnover rate. 

15 The sequence (s) of biological interest may be a polypeptide or 
a chemical compound (e.g. a drug or pro-drug) or a 
carbohydrate which is heterologous to the C4bp core protein 
used in the invention. In other words, it is not part of the 
same molecule in nature. It may be derived from the same 

10 organism. When the attached moiety is a chemical compound, 
the attachment may serve to protect the compound from 
metabolism and excretion, for example by hepatic cytochromes, 
as well as serving to deliver it to tissues. Examples of 
polypeptides include those used for medical or bio- 

25 technological use, such as insulin, cytokines including 

interleukins and interferons, antibodies and their fragments, 
growth factors, receptors, receptor ligands, agonists or 
antagonists, enzymes, enzyme antagonists, antigens, toxins and 
proteases. 



Fusion proteins prepared according to the invention, and the 
novel fusion proteins of the invention described herein, may 
be prepared in the form of a pharmaceutical composition which 
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comprises the protein together with one or more 
pharmaceutically acceptable carriers or diluents. The 
composition will be prepared according to the intended use and 
route of administration of the fusion protein. 

6 

Pharmaceutically acceptable carriers or diluents include those 
used in formulations suitable for oral, rectal, nasal, topical 
(including buccal and sublingual), vaginal or parenteral 
(including subcutaneous, intramuscular, intravenous, 
10 intradermal, intrathecal and epidural) administration. The 

formulations may conveniently be presented in unit dosage form 
and may be prepared by any of the methods well known in the 
art of pharmacy. 

For solid compositions, conventional non-toxic solid carriers 

15 include, for example, pharmaceutical grades of mannitol, 

lactose, cellulose, cellulose derivatives, starch, magnesium 
stearate, sodium saccharin, talcum, glucose, sucrose, 
magnesium carbonate, and the like may be used. The active 
compound as defined above may be formulated as suppositories 

10 using, for example, polyalkylene glycols, acetylated 
triglycerides and the like, as the carrier. Liquid 
pharmaceutically administrable compositions can, for example, 
be prepared by dissolving, dispersing, etc, a fusion protein 
of the invention optional pharmaceutical adjuvants in a 

15 carrier, such as, for example, water, saline aqueous dextrose, 
glycerol, ethanol, and the like, to thereby form a solution or 
suspension. If desired, the composition to be administered 
may also auxiliary substances such as pH buffering agents and 
the like. Actual methods of preparing such dosage forms are 

30 known, or will be apparent, to those skilled in this art; for 
example, see Remington's Pharmaceutical Sciences, Mack 
Publishing Company, Easton, Pennsylvania, 19th Edition, 1995. 
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The composition or formulation to be administered will, in any 
event, contain a quantity of the active compound (s) in an 
amount effective to alleviate the symptoms of the subject 
being treated. Dosage forms or compositions containing active 
5 ingredient in the range of 0.25 to 95% with the balance made 
up from non-toxic carrier may be prepared. 

Parenteral administration is generally characterized by 
injection, either subcutaneously, intramuscularly or 

0 intravenously. Injectables can be prepared in conventional 
forms, either as liquid solutions or suspensions, solid forms 
suitable for solution or suspension in liquid prior to 
injection, or as emulsions. Suitable excipients are, for 
example, water, saline, dextrose, glycerol, ethanol or the 

5 like. A more recently devised approach for parenteral 

administration employs the implantation of a slow-release or 
sustained-release system, such that a constant level of dosage 
is maintained. See, e.g., US Patent No. 3,710,795. 

;0 The following classes of polypeptides are preferred, but the 
invention is not limited thereto: 

Cytokines 

Interleukins include any known interleukin including IL-1, IL- 
[5 2, IL-3, IL-3, IL-4, IL-5, IL-6, IL-7, IL-8, IL-9, IL-10, IL- 
11 and IL-12. Interleukins are modulators of the immune 
system. Some interleukins are involved in the inflammatory 
response or in the immune response to disease. 

JO Interferons include any form of IFN-alpha, as well as IFN-beta 
and IFN-gamma. These also have use in modulation of the 
immune response. 
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A further class of cytokines are the tumour necrosis factors 
TNF-alpha and TNF-beta. 

Other cytokines include members of the MIP family including 
5 MlP-la, MIP-1(5 and RANTES. RANTES binds the CCR5 HIV co- 
receptor and therapy with RANTES may be effective in 
alleviating the progression of HIV infection. 

Antibodies 

10 The affinity of antibodies or antibody fragments for antigens 
may be increased by oligomerisation when the antibodies are 
produced as C4bp core fusion proteins according to the method 
of the present invention. Antibody fragments may be fragments 
such as Fv, Fab and F(ab')2 fragments or any derivatives 

15 thereof, such as a single chain Fv fragments. The antibodies 
or antibody fragments may be non-recombinant, recombinant or 
humanised. The antibody may be of any immunoglobulin isotype, 
e.g., IgG, IgM, and so forth. 

10 In another aspect, the antibody fragments may be camelised Vh 
domains. It is known that the main intermolecular 
interactions between antibodies and their cognate antigens are 
mediated through Vh CDR3. However, Vn-only antibodies, such as 
those derived from camel or llama (naturally Vn-only single 

25 chain antibodies), have only low affinity for cognate antigen. 

The method of the present invention makes it possible to 
obtain improved yields of oligomers of C4bp core proteins with 
Vh domains, or Vh CDR3 domains which are high-affinity 
30 antibodies. Two or more domains may be included to the C4bp 
core oligomer made according to the method of the present 
invention; up to 8 domains may be included, forming an 
octameric antibody molecule. 



I 
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Antibody targets may include tumour-associated antigens, 
including CEA and erbB, which are found in many colon and 
breast tumours respectively. 

5 

In one embodiment, the biological protein of interest may 
comprise the antibody fused to an enzyme capable of converting 
a prodrug into a drug toxic to the tumour cell. This can be 
used in a method of antibody-directed enzyme-prodrug therapy 
10 (ADEPT) . Alternatively, monomers of carrying a tumour 

directed antibody and monomers carrying such an enzyme (e.g. a 
carboxypeptidase, a nitroreductase or the like) may be co- 
expressed in a cell or expressed in separate cells and mixed 
together to form heteromultimers directed to a tumour cell. 

15 

Antibodies may also be targeted to antigens of pathogenic 
organisms, including those mentioned below in the context of 
antigens for use as immunogens. 

>0 Growth factors 

Growth factors include hormones such as growth hormone (in 
particular human growth hormone, hGH, as well as monocyte 
colony stimulating factor (M-CSF) , granulocyte colony 
stimulating factor (G-CSF) , granulocyte macrophage colony 

15 stimulating factor (GM-CSF) , erythropoietin and platelet 

derived growth factor (PDGF) . Active fragments of such growth 
factors may also be used. Mammalian, particularly human, 
growth factors are particularly preferred. 

30 Receptors 

Receptors may be useful therapeutically in binding to proteins 
in the human body which are expressed at aberrant or unwanted 
levels . 
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For example, over-expression of TNF-alpha is associated with 
rheumatoid arthritis, and anti-TNF therapy has been successful 
in treatment of this condition. The biological protein of 
5 interest may thus be a TNF-alpha receptor, 

h receptor of interest is also another member of the TNF 
receptor family, known as the BAFF receptor (Thompson at ai. 
Science, 2001, 293, 2108). The human BAFF receptor (Genbank 
10 Accession no- AF373846) is a 184 amino acid protein which 

binds the TNF-related ligand BAFF. Over-expression of this 
ligand in mice can cause a systemic lupus erythematosis (SLE)- 
like symptom, and thus the BAFF receptor is of interest as a 
possible therapeutic of this disease. 

15 

In one aspect, the invention provides a fusion protein of the 
C4bp core and a BAFF receptor, including fragment of the 
extracellular domain thereof capable of binding a BAFF ligand. 
Such a fragment may correspond to amino acids 2-51 of BAFF. 

>0 

Cell surface receptors are also of interest. For example, CD4 
receptor is a target for the HIV surface protein gpl20/160, 
and it has been widely proposed in the art to use CD4, or a 
soluble fragment thereof, as a therapeutic for HIV infection 
15 such that the CD4 blocks the ability of circulating HIV to 
enter CD4+ T-cells. 

Other cell surface receptors are also associated with viral 
infection, for example CD46 with measles virus (Christiansen 
30 et ai, iJbid) , and such cell surface receptor proteins may also 
be used in the present invention. 




21 



Receptor ligandSr agonists or antagonists 
Many cell surface receptors are activated by dimerisation - 
Well known examples are those for insulin and erythropoietin. 
The function of the ligand is to bind simultaneously to two 

5 receptors, thus dimerising and activating them. In the 

examples cited, receptor autophosphorylation occurs. This 
activates the receptor, which has a tyrosine kinase domain in 
its intracellular portion. The kinase is inactive when the 
receptor is monomeric, but is activated on dimerisation. This 

10 triggers a cascade of intracellular events, collectively 
referred to as signal transduction. 

Whilst some ligands, such as substance P, are short 
polypeptides, others (including insulin (51 amino acids) as 

15 well as kinase and phosphatase substrates) are complex 

molecules which possess binding loops projecting from the 
surface thereof. Smaller molecules which can mimic the 
natural ligands for receptors are useful for research purposes 
(for example to understand the specificity of ligand receptor 

>0 binding) . 

Short peptides or loops may be incorporated into fusion 
proteins according to the present invention to form a 
polyvalent receptor ligand or kinase / phosphatase substrate, 
25 useful for activating or inhibiting receptors and/or kinases 
at very low concentrations. 

Variation may be introduced into the heterologous polypeptides 
inserted onto the scaffold in order to map the specificity of 
50 receptors or kinases/phosphatases for their ligands or 

substrates. Variants may be produced of the same loop, or a 
set of standard different loops may be devised, in order to 
assess rapidly the specificity of a novel kinase/phosphatase. 
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Variants may be produced by randomisation of sequences 
according to known techniques, such as PGR. They may be 
subjected to selection by a screening protocol, such as phage 
display, before incorporation into protein scaffolds in 
5 accordance with the invention. 

Agonists include peptides, including peptide mimetics, which 
bind to a receptor so as to trigger the action of the receptor 
in even in the absence of the natural ligand for that 

0 receptor. An example of an agonist is the thrombopoeitin 
agonist peptide. This linear 14-mer peptide is found to be 
4,000~fold more active when dimeric than when monomeric (Dower 
W.J. et al. Stem Cells (1998) 16, Suppl 2, 21 Peptide agonists 
of the thrombopoietin receptor) . Fusion of this sequence, 

5 lEGPTLRQWLAAE^, to the core domain of C4bp, as described below 
for other peptides should create a very potent thrombopoietin 
agonist, useful for promoting platelet production and/or 
maturation. 

:0 In a further aspect, the invention provides a recombinant 
protein comprising a C4bp core protein and a thrombopoeitin 
agonist peptide, and the use of such a protein in a method of 
therapy for promoting platelet production and/or maturation in 
a human subject. The method comprises administering to a 

:6 subject in need of treatment an effective amount of the 
protein. 

A further example of an agonist is the somatostatin peptide. 
This cyclic peptide is known to bind to a number of G-protein 
\0 coupled receptors, and to inhibit the release of somatotropin. 
An analogue is marketed as Sandostatin (Novartis) for a number 
of medical indications, including the treatment of side 
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effects associated with malignant carcinoid tumours and the 
treatment of diarrhea caused by gastrointestinal infections. 

Fusion of the somatostatin sequence to filamentous 
5 bacteriophage^ as described in the British Journal of 

Pharmacology (1998) "Somatostatin displayed on filamentous 
phage as a receptor-specific agonist". Volume 125^ pages 5-16, 
produces a hybrid phage capable of binding to and activating 
somatostatin receptors. Fusion of somatostatin to the C4bp 
10 scaffold (with the scaffold replacing the phage) similarly 

produces an avid agonist for somatostatin receptors, which has 
more desirable properties as a medicament than hybrid phage. 
Similarly, the oligomeric agonist so produced is capable of 
oligomerising the somatostatin receptors, which may enhance 
15 signalling, as described by Patel et al. in Proc. Nat I.Acad. 
Sci. USA (2002) Volume 99, pages 3294-3299. 

Thus the invention provides a means to prepare a recombinant 
fusion protein as set out above wherein said fusion protein 

>0 comprises somatostatin. The invention further provides a 
fusion protein of a C-terminal core protein of C4bp alpha 
chain linked to somatostatin. The invention further provides 
the use of this protein or nucleic acid vectors (as further 
defined and described herein) encoding this protein in a 

15 method of treatment, including the treatment of side effects 
associated with malignant carcinoid tumours and the treatment 
of diarrhea caused by gastrointestinal infections. 

Antagonists include peptides which bind to receptors and block 
30 the natural ligand from binding. 
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En zymes 

Numerous biological reactions involve the sequential, and/or 
synergistic, action of a plurality of protein activities. 
Such protein activities may be incorporated into a single 

6 molecule in accordance with the present invention. 

Preferably/ therefore, the monomers which are used to compose 
the oligomer according to the invention incorporate amino acid 
sequences which encode distinct biological activities. The 
activities are advantageously complementary, such that they 

to are required sequentially in a biological reaction, or act 
synergistically . The invention therefore provides 
plurifunctional macromolecular structures comprising one or 
more enzymes. 

15 . Examples of enzymes include bacterial enzymes such as DsbA of 
E. coli. 

Antigens 

A particular use for multimers of produced in accordance with 
20 the invention is in the production of immunogens (this term is 
used interchangeably herein with ^^antigens") . A major 
application of this C4bp core fusion protein scaffold 
technology produced following the method of the present 
invention is the use of the assembled or multimerised peptides 
25 or polypeptides as antigens- The oligomerisation improves 
both detection of antibodies against, and the induction of 
antibodies to, such antigens- Some of these antigens may be 
of prophylactic value; they might be useful for vaccination. 
The method allows rapid progress from nucleotide sequences to 
30 the production of recombinant antigens in a polyvalent form. 
Predicted open reading frames (ORFs) can be used to design 
oligonucleotide sequences encoding the predicted protein 
sequence. Cloning of these oligonucleotides into the vectors 
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encoding the C4bp core protein allows a very rapid production 
of antigens, without, for example the need for isolating cDNAs 
and expressing them in heterologous systems such as coli. 

5 Bacterial immunogens, parasitic immunogens and viral 

immunogens are useful as polypeptide moieties to create 
multimeric or hetero-multimeric C4bp fusion proteins useful as 
vaccines. 

10 Bacterial sources of these immunogens include those 

responsible for bacterial pneumonia, Pneumocystis pneumonia^ 
meningitis, cholera, tetanus, tuberculosis and leprosy. 

Parasitic sources include malarial parasites, such as 
15 Plasmodium. 

Viral sources include poxviruses, e.g., cowpox virus and orf 
virus; herpes viruses, e.g., herpes simplex virus type 1 and 
2, B-virus, varicellazoster virus,, cytomegalovirus, and 

20 Epstein-Barr virus; adenoviruses, e.g., mastadenovirus; 
papovaviruses, e.g., papillomaviruses such as HPV16, and 
polyomaviruses such as BK and JC virus; parvoviruses, e.g., 
adeno-associated virus; reoviruses, e.g., reoviruses 1, 2 and 
3; orbiviruses, e.g., Colorado tick fever; rotaviruses, e.g., 

25 human rotaviruses; alphaviruses, e.g.. Eastern encephalitis 
virus and Venezuelan encephalitis virus; rubiviruses, e.g., 
rubella; f laviviruses, e.g., yellow fever virus. Dengue fever 
viruses, Japanese encephalitis virus. Tick-borne encephalitis 
virus and hepatitis C virus; coronaviruses, e.g., human 

30 coronaviruses; paramyxoviruses, e.g., parainfluenza 1, 2, 3 
and 4 and mumps; morbilliviruses, e.g., measles virus; 
pneumovirus, e.g., respiratory syncytial virus; 
vesiculoviruses, e.g., vesicular stomatitis virus; 
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lyssaviruseS/ e.g., rabies virus; orthomyxoviruses, e.g., 
influenza A and B; bunyaviruses e.g., LaCrosse virus; 
phleboviruses, e.g.. Rift Valley fever virus; nairoviruses, 
e.g., Congo hemorrhagic fever virus; hepadnaviridae, e.g., 

5 hepatitis B; arenaviruses, e.g., 1cm virus, Lasso virus and 
Junin virus; retroviruses, e.g., HTLV I, HTLV II, HIV-1 and 
HIV-2; enteroviruses, e.g., polio virus 1,-2 and 3, coxsackie 
viruses, echoviruses, human enteroviruses, hepatitis A virus, 
hepatitis E virus, and Norwalk- virus; rhinoviruses e.g., 

10 human rhinovirus-; and filoviridae, e.g., Marburg (disease) 
virus and Ebola virus. 

Antigens from these bacterial, viral and parasitic sources may 
be used in the production of multimeric proteins useful as 
15 vaccines. The multimers may comprise a mixture of monomers 
carrying different antigens. 

Immunogens to human proteins for research or therapeutic 
purposes may be made. Immunogenic peptides, capable of 

10 raising an immune response when exposed to the immune system 
of an organism, are preferred polypeptides for making C4bp 
core protein fusion proteins following the method of the 
invention. The improved yield of oligomerised C4bp core 
fusion proteins from the present invention has many 

>5 applications not only in vaccination but also in research. 

For example, the generation of human gene sequence data by the 
human genome project has made the generation of antisera 
reactive to new polypeptides a pressing requirement. The same 
requirement applies to prokaryotic, such as bacterial, and 

JO other eukaryotic, including fungal, gene products. Immunogens 
of interest fused to C4bp core multimers are thought to have 
increased efficiency due to their increased avidity for 
immunoglobulin molecules. 
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The present invention has many advantages in the generation of 
an inmiune response. For example, the use of oligomers can 
permit the presentation of a number of antigens, 

5 simultaneously, to the immune system. This allows the 

preparation of polyvalent vaccines, capable of raising an 
immune response to more than one epitope, which may be present 
on a single organism or a number of different organisms. Thus, 
vaccines formed according to the invention may be used for 

10 simultaneous vaccination against more than one disease, or to 
target simultaneously a plurality of epitopes on a given 
pathogen. The epitopes may be present in a single monomer 
units or on different monomer units which are combined to 
provide a heteromultimer . 

15 

Moreover, the invention may be exploited by incorporating an 
adjuvant on the C4bp core oligomer, together with the 
immunogen. Suitable adjuvants are, for example, bacterial 
toxins and cytokines, such as interleukins . The potency of 
10 the immunogen is thereby increased, allowing more efficient 

raising of antisera and more efficient immunisation. A highly 
preferred adjuvant is the C3d component of complement. 

Having 'C4bp core fusion proteins is useful in the context of 
immunisations, because the core protein is not only present 

15 normally in the serum or plasma of the recipient of the 

immunisation, but also because it does not itself evoke an 
immune response. C4bp proteins are known in a number of 
mammalian species, and the appropriate homologues for 
mammalian species may be found by those skilled in the art 

30 using standard gene cloning techniques. 
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The fact that this system allows production of soluble protein 
in E. coli enables using it to produce, as folded soluble 
proteins, domains or fragments of proteins that would not fold 
when expressed on their own due to a lack of constraint on 

5 their C-terminal and /or N-terminal ends. Engineering a 

specific cleavage site enables production of the free domain 
of interest. Similarly constraining the N-terminal and/or C- 
terminal end of a peptide of interest could be beneficial 
during refolding processes. Furthermore, as the 

10 oligomerisation structure is very resistant to denaturation 

and to disassembly, it would be stable during denaturation of 
the inserted protein. Therefore, during refolding, for an 
equal amount of protein of interest, the actual concentration 
of free protein would be diminished by a factor equal to the 

15 oligomerisation number. Oligomerisation may also be 

beneficial for purification purposes as many methods in 
protein technology are not optimised to work with proteins and 
specifically peptides of very low molecular weight. 

>0 Assay methods 

The C4bp core fusion proteins produced following the method of 
the invention may be applied to the detection or the 
neutralisation of antibodies in vivo or in vitro. For 
example, in vitro polyvalent or monovalent antigen-bearing 

>5 C4bp core fusion proteins may be used to select antibody 

molecules derived from phage display experiments. Moreover, 
in vivor antigen-bearing C4bp core fusion proteins produced 
according to the method of invention may be used to neutralise 
antoantibodies in autoimmune disease, or to detect antibodies 

30 which may be indicative of pathological conditions, such as in 
HIV testing or other diagnostic applications. 
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Phage Display 

Phage display technology has proved to be enormously useful in 
biological research. It enables ligands to be selected from 
large libraries of molecules. The proteins of the present 

5 invention also harnesses the power of this technique, but with 
some powerful advantages over normal applications. C4bp 
molecules can be displayed as monomers on fd bacteriophages, 
just as single-chain Fv molecules are. Libraries of fusions 
are constructed by standard methods, and the resulting 

0 libraries screened for ligands of interest. It is important 
to note that this is an affinity based selection. After 
characterisation, the ligands selected for affinity, can be 
oligomerised, and thus take advantage of avidity. When the 
target for the ligand is oligomeric, very tight binding will 

5 result. Furthermore, ligands selected as monomers, will be 
able to cross-link or oligomerise their binding partners. An 
application of this effect is in triggering receptor 
activation. 

!0 Protein chips 

Currently, DNA microarrays, whether of oligonucleotides, PGR 
products or cloned DNAs, are major tools enabling rapid 
development in the highly parallel analysis of gene 
expression. Clearly, in many situations, it would be far 

i5 preferable to monitor gene expression directly, that is, by 
assaying protein expression levels rather than mRNA levels. 
. The latter are but an indirect measure of gene activity which 
rely on the hybridisation of labelled cDNA and can be very 
misleading because there is often a poor correlation between 

)0 the abundance of a particular mRNA and the frequency at which 
it is translated into proteins. In addition, mRNA analysis 
can not possibly determine whether the encoded protein, even 
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if translated, is active. This may depend on post- 
translational modification. 

Thus protein arrays comprising fusion proteins of a core 
5 scaffold and a range of ligands for proteins of interest may 
be produced and used to determine levels of expression of 
those proteins in a sample. 

For example, an array of bacterial cells expressing the 
0 scaf f old-ligand fusions may be provided, such that the fusions 
are expressed and recovered in situ, followed by addition of 
the sample. Alternatively, the fusions may be produced 
separately and then arrayed on a suitable solid support to 
provide for detection of the proteins in the sample. 
5 Detection may be by providing a predetermined amount of the 
proteins of interest labelled to compete against the proteins 
present in the sample, and measuring how much labelled protein 
binds to the ligand. Alternatively, the ligand may be 
labelled and the amount of labelled ligand bound to the 
!0 protein of interest detected. 

Nucleic Acids 

Proteins comprising the C4bp core are produced by expression 
of the protein in a prokaryotic host cell, using a nucleic 
!5 acid construct encoding the recombinant protein. 

The construct will generally be in the form of a replicable 
vector, in which sequence encoding the protein is operably 
linked to a promoter suitable for expression of the protein in 
JO a desired host cell. The promoter may be an inducible 

promoter. Suitable promoters include the T7 promoter, the tac 
promoter, the trp promoter,' the lambda promoters Pl or Pr and 
others well known to those skilled in the art. 
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The vectors may be provided with an origin of replication and 
optionally a regulator of the promoter. The vectors may 
contain one or more selectable marker genes ^ for example an 

5 antibiotic resistance gene such as an ampicillin, tetracycline 
or preferably kanamycin resistance gene. There are a wide 
variety of bacterial expression vectors known as such in the 
art, and the present invention may utilise any vector 
according to the individual preferences of those of skill in 

10 the art. 

A wide variety of prokaryotic host cells can be used in the 
method of the present invention. These hosts may include 
strains of Escherichia, Pseudomonas, Bacillus, Lactobacillus; 

15 ThermophiluS; Salmonella, Enterobacteriacae or Streptomyces. 

For example, if E. coli from the genera Escherichia is used in 
the method of the invention, preferred strains of this 
bacterium to use would include BL21(DE3) and their derivatives 
including C41(DE3); C43 (DE3) or C0214 (DE3) , or other strains 

10 resistant to the toxicity of recombinant protein expression as 
described and made available in WO98/02559. 

Even more preferably, derivatives of these strains lacking the 
prophage DE3 may be used when the promoter is not the T7 
>5 promoter. 

DNA vacclrxBs and therapeutics 

In another aspect, the invention provides a eukaryotic 
expression vector comprising a nucleic acid sequence encoding 
30 a recombinant fusion protein comprising a scaffold of a C- 

terminal core protein of C4bp alpha chain for the use in the 
treatment of the human or animal body. 
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Such treatment would achieve its therapeutic effect by 
introduction of a specific nucleic acid sequence into cells or 
tissues affected by a genetic or other disease, or by 
introduction of a nucleic acid sequence encoding an antigen 

5 for the purposes of raising an immune response. It is also 

possible to introduce genetic sequences into a different cell 
or tissue than that affected by the disease, with the aim that 
the gene product will have direct or indirect impact on the 
diseases cells or tissues. Delivery of nucleic acids can be 

10 achieved using a plasmid vector (in "naked" or formulated 
form) or a recombinant expression vector. 

Various viral vectors which can be utilized for gene therapy 
include adenovirus, herpes virus, vaccinia or an RNA virus 

15 such as a retrovirus. The retroviral vector may be a 

derivative of a murine or avian retrovirus. Examples of 
retroviral vectors in which a single foreign gene can be 
inserted include, but are not limited to: Moloney murine 
leukaemia virus (MoMuLV) , Harvey murine sarcoma virus 

>0 (HaMuSV), murine mammary tumour virus (MuMTV) , and Rous 

Sarcoma Virus (RSV) . When the subject is a human, a vector 
such as the gibbon ape leukaemia virus (GaLV) can be utilized. 

The vector will include a transcriptional regulatory sequence, 
15 particularly a promoter region sufficient to direct the 

initiation of RNA synthesis* Suitable eukaryotic promoters 
include the promoter of the mouse metallothionein I gene 
(Hamer et al. 1982 J, Molec. Appl. Genet. 1, 273); the TK 
promoter of Herpes virus (McKnight^ 1982 Cell 31,355); the 
10 SV40 early promoter (Benoist et ai. 1981 Nature 290, 304); the 
Rous sarcoma virus promoter (Gorman et al, 1982 Proc. Natl 
Acad. Sci. USA 79, 6777); and the cytomegalovirus promoter 
(Foecking et ai. 1980 Gene 45, 101). 
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Promoters specific for the cell type requiring the gene 
therapy are desirable in many instances. In a situation where 
a particular cell type is used as a platform to produce 

5 therapeutic proteins destined for another site (for either 
direct or indirect action) , then the chosen promoter should 
work well in the "factory" site. Muscle is a good example for 
this, as it is post-mitotic, it could produce therapeutic 
proteins for years on end as long as there is no immune 

0 response against the protein-expressing muscle fibres. 

Therefore, use of strong muscle promoters as are particularly 
applicable here. Except for treating a muscle disease per se, 
use of muscle is typically only suitable where there is a 
secreted protein so that it can circulate and function 

15 elsewhere (e.g., hormones, growth factors, clotting factors). 

Administration of vectors of this aspect of the invention to a 
subject, either as a plasmid vector or as part of a viral 
vector can be affected by many different routes. Plasmid DNA 

!0 can be "naked" or formulated with cationic and neutral lipids 
(liposomes) or microencapsulated for either direct or indirect 
delivery. The DNA sequences can also be contained within a 
viral (e.g., adenoviral, retroviral, herpesvius, pox virus) 
vector, which can be used for either direct or indirect 

i5 delivery. Delivery routes include but are not limited to 

intramuscular, intradermal (Sato, Y. et al. 1996 Science 273, 
352), intravenous, intra-arterial, intrathecal, intrahepatic, 
inhalation, intravaginal instillation (Bagarazzi et al. 1997 J 
Med. Primatol. 26, 27), intrarectal, intratumour or 

50 intraperitoneal . 

Thus the invention includes a vector as described herein as a 
pharmaceutical composition useful for allowing transfection of 
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some cells with the DNA vector such that a therapeutic 
polypeptide will be expressed and have a therapeutic effect 
(to ameliorate symptoms attributable to infection or disease) . 
The pharmaceutical compositions according to the invention are 

5 prepared by bringing the construct according to the present 

invention into a form suitable for administration to a subject 
using solvents, carriers, delivery systems, excipients, and 
additives or auxiliaries. Frequently used solvents include 
sterile water and saline (buffered or not) . One carrier 

10 includes gold particles, which are delivered biolistically 

(i.e., under gas pressure). Other frequently used carriers or 
delivery systems include cat ionic liposomes, cochleates and 
microcapsules, which may be given as a liquid solution, 
enclosed within a delivery capsule or incorporated into food. 

15 

An alternative formulation for the administration of gene 
therapy vectors involves liposomes. Liposome encapsulation 
provides an alternative formulation for the administration of 
polynucleotides and expression vectors. Liposomes are 

10 microscopic vesicles that consist of one or more lipid 

bilayers surrounding aqueous compartments. See, generally, 
Bakker-Woudenberg et al. 1993 Eur. J. Clin. Microbiol. Infect. 
Dis. 12,Suppl. 1,S61, and Kim, 1993 Drugs 46, 618. Liposomes 
are similar in composition to cellular membranes and as a 

25 result, liposomes can be administered safely and are 
biodegradable. Depending on the method of preparation, 
liposomes may be unilamellar or multilamellar, and liposomes 
can vary in size with diameters ranging from 0.02 ]iM to 
greater than 10 pM. See, for example, Machy et al. 1987 

30 LIPOSOMES IN CELL BIOLOGY AND PHARMACOLOGY (John Libbey) , and 
Ostro et ai. 1989 American J. Hosp. Phann. 46, 1576. 

Expression vectors can be encapsulated within liposomes using 
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standard techniques. A variety of different liposome 
compositions and methods for synthesis are known to those of 
skill in the art. See, for example, U.S. Pat. No. 4,844,904, 
U.S. Pat. No. 5,000,959, U.S. Pat. No. 4,863,740, U.S. Pat. 
5 No. 5,589,466, U.S. Pat. No. 5,580,859, and U.S. Pat. No. 

4,975,282, all of which are hereby incorporated by reference. 

In general, the dosage of administered liposome-encapsulated 
vectors will vary depending upon such factors as the patient's 
iO age, weight, height, sex, general medical condition and 
previous medical history. Dose ranges for particular 
formulations can be determined by using a suitable animal 
model. 

15 In one embodiment, the vector encodes a fusion protein 

comprising the core and, in addition, one or more antigens and 
optionally and preferably a protein with immunostimulatory 
properties. C3d is known to have strong immunostimulatory 
properties and may be used for this purpose, as may be an 

10 interleukin, particularly IL-2 or IL-12. 

Cell culturxng 

Plasmids encoding fusion proteins in accordance with the 
invention may be introduced into the host cells using 

25 conventional transformation techniques, and the cells cultured 
under conditions to facilitate the production of the fusion 
protein. Where an inducible promoter is used, the cells may 
initially be cultured in the absence of the inducer, which may 
then be added once the cells are growing at a higher density 

30 in order to maximise recovery of protein. 

Cell culture conditions are widely known in the art and may be 
used in accordance with procedures known as such. 
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Recovery of protein from culture 

Once the cells have been grown to allow for production of the 
protein, the protein may be recovered from the cells. Because 
5 we have found that surprisingly, the protein remains soluble, 
the cells will usually be spun down and lysed by sonication 
which keeps the protein fraction soluble and allows this 
fraction to remain in the supernatant following a further 
higher-speed (e.g. 15,000 rpm for 1 hour) centrif ugation. 

10 

The fusion protein in the supernatant protein fraction may be 
purified further by any suitable combination of standard 
protein chromatography techniques. We have used ion-exchange 
chromatography followed by gel filtration chromatography. 
15 Other chromatographic techniques, such as affinity 
chromatography, may also be used. 

In one embodiment, we have found that heating the supernatant 
sample either after centrif ugation of the lysate, or after any 
10 of the other purification steps will assist recovery of the 
protein. The sample may be heated to about 70 - 80 °C for a 
period of about 10 to 30 minutes. 

Depending on the intended uses of the protein, the protein may 
>5 be subjected to further purification steps, for example 
dialysis, or to concentration steps, for example freeze 
drying. 

The invention is illustrated by the following examp-les. 

30 
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Example 1. Production of db-'C4bp 
Vector construct. 

An expression vector encoding the downstream box peptide 
sequence MASMNHKGS (Sprengert Fuchs E. and Porter A.G 

5 1996 ^'The downstream box: an efficient and independent 

translation initiation signal in Escherichia coli." EMBO J. 
Volume 15, 665-674) fused N-terminal to the 57 amino acid 
"core" domain of the human C4bp alpha chain was constructed. 

0 Briefly, the C4bp core domain is encoded entirely within a 
single exon in the human genome, thus allowing it to be 
amplified directly from human genomic DNA. The oligo- 
nucleotide primers used were: 

AVD102: 5'CCCGCGGATCCGAGACCCCCGAAGGCTGTGA3' ; and 

5 AVD103: 5' CCCCGGAATTCTTATTATAGTTCTTTATCCAAAGTGG3' . 

These contained added restriction sites which were used for 
cloning the amplified DNA fragment. The 183 base-pair fragment 
obtained on digesting the PGR product with the enzymes BamHI 
and EcoRI was cloned downstream of the translational enhancer 

!0 or '^downstream box" and the T7 promoter in a plasmid vector. 
The plasmid was derived from the plasmid pRsetA supplied by 
Invitrogen, but the fl origin of replication has been replaced 
by the par locus from the plasmid pSClOl. It thus contains as 
functional elements: a selectable marker (ampicillin 

!5 resistance) an origin of replication (derived from the pUC 

family) and a T7 promoter and a T7 transcription terminator as 
well as the par locus. The resulting construct was designated 
plasmid pAVD 77. Figure 8 shows the sequence of the 
translational enhancer and T7 promoter fused to the coding 

10 sequence of C4bp (in small print) . 

The predicted size of the db-C4bp fusion protein is 74 91.5 Da. 
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Transformation and expression. 

The vector was transformed into the E. coli strain C41{DE3)/ a 
derivative (Bruno Miroux and John E. Walker 1996 "Over- 
5 production of Proteins in Escherichia coli: Mutant Hosts that 
Allow Synthesis of some Membrane Proteins and Globular 
Proteins at High Levels." Journal of Molecular Biology Volume 
260, 289-298) of BL21{DE3). 

0 One litre of LB-Ampicillin medium was inoculated with the 

cells, which were incubated at 37**C with shaking for 3 hours 
(until OD600 nm reached 0.6) and then it was induced with IPTG 
(isopropylthiogalactoside) at a final concentration 0.7 mM for 
3 hours. The cells were harvested by centrif ugation at 4600 

5 rpm for 30 min. 

The pellet (P) was resuspended with 30 mis Tris 50 mM pH 7, 
and the cells were broken by sonication using an Emulsiflex 
apparatus twice (between each treatment, centrif ugation at 
!0 15000 rpm for 1 hour, the supernatants from each spin 

(designated SNl and SN2 respectively) were kept and the pellet 
PI was resuspended with the same buffer) . 

Both supernatants were pooled (60 mis) and were split into two 
15 solutions of 30 mis. Each of these 30 ml aliquots of the db- 
C4bp fusion protein was purified using one of two similar 
methods: these were identical except that a heating step in 
one method was replaced by a MonoQ ion-exchange step in the 
other. 

10 

Purification without a heating step 

The native db-C4bp was purified from 500 mis of culture by 
ion-exchange chromatography (DEAE Fast Flow 70, using a column 
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of 13cm in height, and diameter of 2.6cm), using TrisHCl 
buffer (50mM pH7) and a salt gradient (OM - IM NaCl) . The 
fusion protein eluted between 300-400 mM NaCl* Fractions of 
7.5 ml each were collected - see Figure 2. 

5 

Fractions B8 to Bll were pooled and dialyzed against TrisHCl 
20 mM pH7. Then this solution was loaded on a ion-exchange 
column (MonoQ HR 16/10), using Tris buffer (50mM pH7) and a 
salt gradient (OM - IM NaCl). Fractions of 2.5 ml were 
10 collected. The fusion protein eluted between 500-550 mM NaCl 
(Figure 3) . 

Fractions AlO to Bl were pooled and the final solution was 
then concentrated to a volume of 10 mis before being 

15 chromatographed on a gel filtration column (S75 26/60). 

Fractions of 5 ml were collected. The fusion protein was 
eluted from this column with a volume of 139 mis buffer 
(TrisHCl 100 mM pH7, 150 mM NaCl), see Figure 4. The 
calibration of the column with molecular weight standards 

>0 implies a molecular weight for this protein similar to albumin 
(67 kDa), which in Tris 50 mM + NaCl 150 mM also elutes with a 
volume of 139 mis, whereas the expected molecular weight of 
the monomer is 7.491 kDa. This indicates that the fusion 
protein is oligomeric in structure when purified from the 

25 cytosol of E. coli, without any steps being taken to refold 
it. 

Fractions AlO to Bl were pooled (312 yg/ml), and an aliquot 
was dialysed against sodium phosphate buffer, 100 mM, pH 7.4. 

30 

The protein yield per Litre of culture after purification was 
12.4 milligrams. 
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The CD spectrum was examined and showed the presence of 
significant secondary structure, consistent with a properly 
folded protein complex. 

5 Example 2. Purification of db-C4bp with a heating step 

The solution containing the other 30 ml aliquot of db-C4bp was 
heated at 76^C for 15 minutes and then centrifuged at 20,500 
rpm for 1 hour. The supernatant, containing db-C4bp, was 
purified by ion-exchange chromatography (DEAE Fast Flow 70 
0 mis), using Tris buffer {50mM pH7) and a salt gradient (OM - 
IM NaCl) . Fractions of 7.5 ml were collected. The fusion 
protein eluted between 300-400 mM NaCl (Figure 5) . 

Fractions B8 to Bll were pooled and the final solution was 
15 then concentrated to a volume of 10 mis before being 

chromatographed on a gel filtration column (S-75 26/60). 
Fractions of 5 ml were collected. The fusion protein was 
eluted from this column with a volume of 140mls buffer (Figure 
6) . The calibration of the column with molecular weight 
iO standards implies a molecular weight identical to that of the 
protein purified without heating (see above), whereas the 
expected molecular weight of the monomer is 7,4 91 kDa. This 
fusion protein is therefore also oligomeric in structure when 
purified from the cytosol of E, coli, without any steps being 
15 taken to refold it. Furthermore, it remains oligomeric 
despite being heated to 76*C for 15 minutes in a buffer 
comprising 50 mM TrisHCl pH7 (i.e. no salt was present). 

Fractions All to Bl were pooled (595.5 yg/ml) and an aliquot 
JO was dialysed against sodium phosphate (NaP) buffer 100 mM pH 
7.4. 
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Analysis using circular dichroism showed that the spectrum 
obtained with the sample which had been subjected to heating 
was equivalent to that obtained using the unheated sample. 
This demonstrated that the secondary structure elements of the 
5 protein are retained despite heating. 

The yield with the heating step was 3.5 milligrams per litre. 

The addition of a heating step can significantly simplify the 
0 purification of proteins. In the example here, heating* 
replaced one ion-exchange (MonoQ) step, and nevertheless 
resulted in a protein of at least equivalent purity. 

Example 3; Treatment of protein with denaturant 

5 To confirm further that the protein was indeed oligomeric, an 
attempt was made to denature purified protein in 6M 
guanadinium chloride and 20mM DTT (dithiothreitol ) at room 
temperature before repeating gel filtration under denaturing 
conditions . 

!0 

Briefly, a culture of SOOmls of the cells of example 1 were 
grown and induced as described above. The fusion protein was 
purified by ion-exchange chromatography, using TrisHCl buffer 
(50mM pH 7.4) and a salt gradient (OM - IM NaCl) . The fusion 
*5 protein eluted between 450-650 mM NaCl and was then 

concentrated to a volume of 10 mis. After this concentration 
step, the concentration of db-C4bp protein was 740 micrograms 
per ml. 

)0 The protein was then treated at a concentration of 740 
micrograms per ml overnight at 4'C with 6M guanidinium 
chloride and 20 mM DTT before being chromatographed on a gel 
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filtration column {S-75) • The fusion protein was eluted from 
this column with a volume of 11.4 mis buffer. Calibration of 
the column with molecular weight standards implies a molecular 
weight for this protein of approximately 60 kDa^ whereas the 
5 expected molecular weight of the monomer is 7.5 kDa. This 
fusion protein is therefore oligomeric in structure when 
purified from the cytosol of E. colif without any steps being 
taken to refold it and even when treated to denaturing 
conditions. 

0 

Repeating the denaturation step using 6M guanidine HCl for 2 
or 16 hours and heating to 75**C-80**C did result in denatured 
protein, as evidenced by CD analysis . 

5 Example 4: Cloning and recombinant expression in E. coli of 
the human C4bp core fused to a histidine tag sequence. 

To demonstrate that the translational enhancer is not 
essential for high-level expression of the core domain in 
Escherichia coli, and to facilitate the purification of the 
:0 protein, the DNA sequence encoding the downstream box was 

replaced by a sequence encoding a 6xHistidine tag by replacing 
an Ndel/BamHI restriction fragment in pAVD 77 with the 
following sequence: 

!5 CATATGCGGG GTTCTCATCA TCATCATCAT CATGGTCTGG TTCCGCGTGG ATCC 

The resulting plasmid pAVD 93, overproduces a recombinant 
protein of 8.4 6 kDa with the following amino acid sequence: 

JO MRGSHHHHHH GLVPRGSETP EGCEQVLTGK RLMQCLPNPE DVKMALEVYK 
LSLEIEQLEL QRDSARQSTL DKEL 
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The plasmid pAVD 93 was transformed into the bacterial strain 
C41(DE3) and expression of the fusion protein was induced 
using IPTG as described in above. A protein of 8.5 kDa as 
shown by SDS-PAGE analysis was present in induced cultures 3 
5 hours after induction but absent from uninduced cultures. 

Example 5: Cloning and recombinant expression in E. coli of 
the human C4bp core fused to the DsbA protein 

The fusion of the C4bp core domain to the short peptide 
10 sequences encoded by the downstream box enhancer or to the 
histidine tag does not necessarily imply that the fusion of 
the core domain to larger proteins is feasible. To determine 
this, the C4bp core was fused to the C-terminus of the DsbA 
protein, an enzyme normally found in the E. coli periplasmic 
15 space. DsbA comprises 177 amino acids, and as such, is 

substantially larger than the core domain itself (57 amino 
acids) . 

Construction of the plasmxd pAVD 78, encoding the DsbA~C4bp 
20 fusion protein 

The Ndel-BamHI DNA fragment in pAVD 77 encoding the downstream 
box enhancer was replaced by an Ndel-BamHI fragment encoding 
DsbA. The oligonucleotide primers used to obtain the fragment 
encoding DsbA were: 

25 AVD52: 5' GGGGCCCCCATATGGCGCAGTATGAAGATGGTAAACAG3' ; and 

AVDl 15: 5 ' GGGGAATTCTTAGGATCCAGAACCTTTTTTCTCGGACAGATATTTCAC3 ' . 
These primers were used to amplify the DsbA coding sequence 
(lacking a stop codon) from the genomic DNA of Escherichia 
coli. The PGR product was digested with both Ndel and BamHI 

30 restriction enzymes, and cloned into pAVD 77 in to create pAVD 
78. 



44 



The plasmid pAVD 77 was transformed into the bacterial strain 
C41(DE3) and expression of the fusion protein was induced 
using IPTG as described above. A protein of 28 kDa as shown by 
SDS-PAGE analysis was present in induced cultures 3 hours 
after induction, but absent from uninduced cultures. 
Surprisingly, this protein was present in the soluble fraction 
of the cell extract. 

Purification of the D3bA-C4bp fusion protein 

The fusion protein was purified by two ion-exchange 
chromatographic steps (first DEAE, secondly MonoQ) , using Tris 
HCl buffer (50mM pH 1.4) and a salt gradient (0 M-IM NaCl) in 
each case. The fusion protein eluted after the first (DEAE) 
ion-exchange chromatography at approximately 100 mM NaCl and 
was then purified by a more resolutive (MonoQ) ion-exchange 
chromatography. The fusion protein eluted at 350 mM NaCl from 
the MonoQ and was concentrated before being chromatographed on 
a S200 gel filtration column (10/30). The fusion protein was 
eluted from this column, in a volume of 12.54 mis of buffer. 

Calibration of the column with molecular weight standards 
implies a molecular weight for this protein of approximately 
200 kDa. The expected molecular weight of the monomer is 
28.08 kDa. This fusion protein is therefore also oligomeric 
in structure when purified from the cytosol of E, coli, 
without any steps being taken to refold it. 

To verify that the fusion protein was indeed oligomeric, 
rather than that its behaviour on gel filtration was aberrant, 
the purified protein was denatured in 6M guanidinium chloride 
and 20 mM DTT (for 2 hours 30 minutes at room temperature) and 
the gel filtration repeated under denaturing conditions, (that 
is in the presence of 6M guanidinium chloride and 20 mM DTT) . 
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Under these circumstance, the protein eluted in a volume of 
12-5 mis, consistent with a molecular weight of approximately 
220 kDa. The fusion protein is thus not denatured under these 
5 conditions: the protein is still oligomeric. 

Complete denaturation of this protein was obtained after 
treatment with guanidine HCl for 16 hours at 4°C, in contrast 
to the Example 3 above, where heating to 75-80 ^'C was required 
10 to obtain complete denaturation. 

Activity of the DsbA'C4bp fusion protein 

To test the activity, of DsbA-C4bp, an insulin assay was 
conducted. In the presence of DTT, active DsbA catalyses the 

15 reduction of insulin's disulphide bonds which enables the 
separation of the two chains, and thus provokes the 
precipitation of the free insulin B chain. A turbidimetric 
assay is thus used to detect the reduction of the disulphide 
bonds of insulin. (Holmgren A (1979) Thioredoxin catalyses the 

10 reduction of insulin disulfides by dithiothreitol and 
dihydrolipoamide . J. Biol. Chem. 254, 9627). 

The final reaction mixture contains 0.14 mM freshly prepared 
insulin, 0.1 M potassium phosphate pH 7.0, 2 mM EDTA, and 0.67 
>5 mM DTT, and 100 pg of DsbA or 100 pg of DsbA-C4bp fusion 
protein in a final volume of 1.2 ml. 

The reaction was initiated by addition of 8 yl of 0.1 M DTT 
and monitored by measuring the increase of turbidity at 650 nm 
30 every 5 minutes up to 60 minutes. Each sample was gently 

mixed 3-4 times prior to measuring the absorbance at 650 nm. 
The instrument blank of the reaction contained 0.1 M phosphate 
buffer pH 7.0, and 2 mM EDTA. 
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The results are shown in Figure 1, and demonstrate that the 
DsbA present in the DsbA-C4bp fusion protein is still active, 
and the activity is directly comparable to the activity of 
6 soluble DsbA. 

Example 6: Analysis of C4bp fusion proteins under non-reducing 
conditions 

The analysis of the db-C4bp fusion protein by polyacrylamide 
10 gel electrophoresis under denaturing but non-reducing 

conditions was conducted to determine the presence or absence 
of disulphide bonds between the monomers of the oligomer. 

Aliquots (12vil) of db-C4bp (312 pg/ml) were mixed with Laemmli 
15 buffer (Tris HCl 1.5 M pH 6.8, SDS 2%, glycerol 15%, 0.02% 
Bromophenol blue) with or without (J-mercaptoethanol . These 
samples were boiled at 90°C for 5 min and analysed by 
electrophoresis through a 18% sodium dodecyl sulfate 
polyacrylamide gel, also lacking p-mercaptoethanol . 

>0 

The result was that, in the absence of (J-mercaptoethanol, the 
db-C4bp fusion protein migrated as an oligomer (in the top of 
the gel. Figure 9 right side) showing that disulphide bonds 
exist between the monomers. In contrast, the addition of (5- 
>5 mercaptoethanol resulted in the migration of the db-C4bp 

protein at its monomer molecular weight, as shown in previous 
figures and on the left of Figure 9 (showing reduced samples 
of db-C4bp during purification) . 



